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Abstract 

We consider the temporally-correlated Multiple-Input Multiple-Output (MIMO) broadcast channel (BC) and 
interference channel (IC) where the transmitters have (i) delayed channel state information (CSI) obtained from a 
latency-prone feedback channel as well as (ii) imperfect current CSIT, obtained, e.g., from prediction on the basis 
of these past channel samples. The degrees of freedom (DoF) regions for certain (important) cases of the two-user 
broadcast and interference MIMO networks under such conditions are characterized. The proposed DoF achieving 
schemes exploit a particular combination of the space-time alignment protocol (referred to as MAT alignment) designed 
for fully outdated CSIT feedback channels together with the use of approximate zero-forcing (ZF) precoders. The 
design strategies build and extend upon recently reported techniques for the MISO BC, using compressing and 
forwarding of the residual interference left after the approximate ZF precoding in a digital fashion. 

I. Introduction 

While the capacity region of the Multiple-Input Multiple-Output (MIMO) broadcast channel (BC) was established 
in [1], the characterization of the capacity of Gaussian interference channel (IC) has been a long-standing open 
problem, even for the two-user single-antenna case. Recent progress sheds light on this problem from various 
perspectives, among which the authors in [2] characterized the degrees of freedom (DoF) region, specializing to the 
large SNR regime, for the two-user MIMO IC. Such advance nevertheless suggests achievable schemes which require 
the full knowledge of channel state information (CSI) at both the transmitter and receiver sides. In practice, however, 
the acquisition of perfect CSI at the transmitters is difficult, if not impossible, especially for fast fading channels. The 
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CSIT obtained via feedback suffers from delays, which renders the available CSIT feedback possibly fully obsolete 
(i.e., uncorrelated with the current true channel) under the fast fading channel and, seemingly non-exploitable in view 
of designing the spatial precoding. Recently, the impact of feedback delays in such scenario (referred to as "delayed 
CSIT") was considered in [3] for the MISO BC, and a scheme termed here as MAT alignment was proposed. The 
key contribution in [3] was to establish the usefulness of even completely outdated channel state information in 
designing precoders achieving strictly better DoF than what is obtained without any CSIT. The essential ingredient 
for the proposed scheme in [3] lies in the use of a multi-slot protocol initiating with the transmission of unprecoded 
information symbols to the user terminals, followed by the analog forwarding of the interference overheard in the 
previous time slots. Most recently, generalizations under the similar principle to the MIMO BC [4] and MIMO IC 
[5, 6] settings were also addressed, again establishing DoF strictly beyond the ones obtained without CSIT [7-9] but 
below the ones with perfect CSIT [2]. 

Albeit inspiring and fascinating in nature, these works made the assumption that the delayed CSIT bears no 
correlation with the current channel realization. Hence, these results pessimistically consider that no estimate for the 
current channel realization is producible to the transmitter. Owing to the finite Doppler spread behavior of fading 
channels, it is however the case in many real life situations that the past channel realizations can provide some 
information about the current one. Therefore a scenario where the transmitter is endowed with delayed CSI in addition 
to some (albeit imperfect) estimate of the current channel is of practical relevance. This form of combined perfect 
delayed and imperfect current CSIT was first introduced in [10] for the MISO BC whereby a novel transmission 
scheme was proposed which improves over pure MAT alignment in constructing precoders based on delayed and 
current CSIT estimate. The full characterization of the optimal DoF for the hybrid CSIT was reported in [11, 12] for 
MISO BC and in [13] for MISO IC. The key idea behind the schemes in [10-13] lies in the modification of the 
MAT protocol where i) the initial time slot involves transmission of precoded symbols, followed by the forwarding 
of residual interference overheard in the first time slot, and ii) the taking advantage of the reduced power for the 
residual interference (compared with full power interference in MAT) based on a suitable quantization method. 

In this paper, we extend the results in [10, 11, 13] and consider the two-user temporally -correlated MIMO networks 
both for the BC and IC cases. Note that the generalization from MISO to MIMO networks is unfortunately not 
a trivial step. The additional spatial dimension at the receiver side enables to cancel a certain amount of residual 
interference (generated by the impreciseness of transmit precoding) and thus intuitively is expected to enhance 
the achievable DoF, especially for larger number of transmit than receive antennas. Furthermore, the clarity of the 
description of DoF achieving schemes in MIMO settings is hampered by the enumeration of all possible antenna 
configurations resulting in slightly different transmission strategies as was noted previously for the purely delayed 
CSIT case [5,6]. A similar issue arises here and, indeed, working out the full DoF region for an arbitrary MIMO 
network appears difficult at this stage. In order to ease exposition, we choose to focus on one important particular 
scenario offered by so-called symmetric networks where nodes have the same number of transmit (M) and receive 
(N) antennas, with possible M ^ N. Overall, our results allow to bridge between previously reported CSIT scenarios 
such as the pure delayed CSIT of [3-6] and the pure instantaneous CSIT scenarios for the MIMO setting. We tackle 
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both the BC and IC configurations as we point out the tight connection between the DoF achieving transmission 
strategies in both these settings. 

More specifically, we obtain the following key results: 

• We establish outer bounds on the DoF region for the two-user temporally-correlated MIMO BC and IC with 
perfect delayed plus imperfect current CSIT, as a function of the current CSIT quality exponent. By introducing 
a virtual received signal for IC, we link the outer bound of IC nicely to that of BC, arriving at the same outer 
bound result for both cases. In addition to the genie-aided bounding techniques and the application of the 
extremal inequality in [10, 11, 13], we develop a set of upper and lower bounds of ergodic capacity for MIMO 
channels, which is essential for the MIMO case but not extendable from MISO. 

• We propose schemes which achieve the key vertices of the outer bound region with perfect delayed plus 
imperfect current CSIT. The schemes build on the principles of a time-slotted protocol, starting with the spatial 
zero-forcing (ZF) precoded transmission of information symbols from the transmitter(s) simultaneously and 
followed by forwarding of the residual interferences in a compressed fashion. In particular, the MIMO BC/IC 
with M = 3 and N — 2 achieves sum DoF ( 12 + 3a ). This smoothly bridges two special cases: the case with 
pure delayed CSIT (a = 0) and that with perfect current CSIT (a = 1). 

It is worth noting that our results embrace the previously reported particular cases: the perfect CSIT setting [2] (i.e., 
current CSIT of perfect quality), the pure delayed CSIT setting [5, 6] (i.e., current CSIT of zero quality), and the 
special MISO case [10, 11, 13] with N = 1. 

Notation: Matrices and vectors are represented as uppercase and lowercase letters, and matrix transport, Hermitian 
transport, inverse and determinant are denoted by A T , A H , A^ 1 and det(A), respectively, is the normalized 
orthogonal component of any nonzero vector h. The approximation f(P) ~ g(P) is in the sense of limp-^ = 
C, where C > is a constant that does not scale as P. Partial ordering of Hermitian matrices is denoted by >; and 
If!, i.e., A < B means B — A is positive semidefinite. Logarithms are in base 2 and e for log and In, respectively. 
=, < and > are in the sense of large P. 

II. System Model 

A. Two-user MIMO Broadcast Channel 

For a symmetric two-user (M, N) MIMO broadcast channel (BC) with M antennas at the transmitter and N 
antennas at each receiver, the discrete time signal model is given by 

Vi {t) = Hi(t)x(t) + Zi (t) (la) 

for any time instant t, where Hi(t) e C NxM is the channel matrix for i-th receiver (i = 1, 2); z l {t) <~ Nc (0, Ijv) 
is the normalized AWGN vector at the respective receiver and is independent of channel matrices and transmitted 
signals; the coded input signal x(t) G C Mxl is subject to the power constraint E||:c(i)|| 2 < P, Vt. Denote by 
Hi[k i: k 2 ](t) the submatrix of Hi(t) from ki-th row to fc 2 -th row (fci < fc 2 ). Define H(t) = {H\(t), H 2 (t)} as the 
ensemble of channel matrices. 
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B. Two-user MIMO Interference Channel 

For a symmetric two-user (M, N) MIMO interference channel (IC) with M antennas at each transmitter and N 
antennas at each receiver, the discrete time signal model is given by 

Vi {t) = H a {t)xx{t) + H l2 {t)x 2 {t) + Zi (t) (2a) 

for any time instant t, where Hji(t) <G C NxM = 1,2) is the channel matrix between Tx-i and Rx-j; the 
coded input signal Xi(t) <G C Mxl is subject to the power constraint E||aij(t)|| 2 < P, Vt. Denote by Hn-j(t) the 
k-th row of Hij(t) and Hi[ kl . k2 ]j(t) as the submatrix of Hij(t) from fci-th row to fc 2 -th row (k\ < k 2 ). Define 
H(t) 4 {H n (t),H 21 (t), H 12 (t), H 22 (t)}. 

For notational brevity, we hereafter refer to (M, N) MIMO BC and IC as (M, N) MIMO network, and define 
H k 4 {H(t)} k =1 , and H k 4 {H(t)} k =1 , where fc = 1, ■ ■ ■ , n. 

C. Assumptions 

Assumption 1 (fading channel). At any given time instant t, the element of channel matrix Hi{t) (resp. Hij(t)) is 
drawn from a stationary and ergodic random process, and any concatenated matrix with rows extracted from Hi(t) 
( resp. Hij (t) ) is full rank almost surely. 

Assumption 2 (perfect delayed CSIT and imperfect current CSIT). At each time instant t, the transmitter knows 
perfectly the delayed CSI H* , and obtains an imperfect estimate of the current CSI H{t), which could, for 
instance, be produced by standard prediction based on past samples. The current CSIT estimate is modeled by 

H(t) = H(t) + H(t) (3) 

where the estimate H{t) and estimation error H{t) are independent, and H{t) is independent and identically 
distributed (i.i.d.) with each entry being assumed to be zero-mean and with variance (1 — a 2 ), a 2 , respectively 
(0 < a 2 < 1). Further, we assume the following Markov chain 

(H* _1 , -H"*" 1 ) -> H* -> H\ Vi, (4) 

which means H l is independent of {H l ~ x , H 1 ^ 1 ) conditional on H l . 

Assumption 3 (perfect delayed/current CSIR, and imperfect current CSI at the receiver). At each time instant t, 
the receivers know perfectly the delayed/current CSI and somehow know the same imperfect current CSI at the 
transmitter both up to instant t for all links, i.e., H* and if*. 

Assumption 3 is in accordance with previous works with delayed CSIT, and since linear prediction can be applied 
at the receiver side as well to produce current channel estimate. It does not add any limitation over the assumption 
made in [3-5]. We point out that only local CSIT/CSIR (the channel links with which the node is connected) is 
really helpful and leads to the same result. Nevertheless, we assume the CSIT/CSIR to be available in a global 
fashion for presentation simplicity. 
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We assume that the estimation error a 2 can be parameterized as an exponential function of the power P, so that 
we can characterize the DoF region of MIMO networks with respect to this exponent. To this end, we introduce a 
parameter a > 0, such that 

a = - hm . (5) 

P^oo logP 

This a indicates the quality of current CSIT at high SNR. While a = reflects the case with no current CSIT, 
a -> oo corresponds to that with perfect instantaneous CSIT. As a matter of fact, when a > 1, the quality of 
the imperfect current CSIT is sufficient to avoid the DoF loss, and ZF precoding with this imperfect CSIT is 
able to achieve the maximum DoF [14]. Therefore, we focus on the case a € [0, 1] henceforth. The connections 
between the above model and the linear prediction over existing time-correlated channel models with prescribed 
user mobility are highlighted in [10,15]. According to the definition of the estimated current CSIT, we have 
E\h H {t)h ± {t)\ 2 = <t 2 ~ p- a , with h representing any column of channel matrices, and h being its corresponding 
estimate. 

III. Main Results on the Degrees of Freedom Region 

A rate pair (R\, R2) is said to be achievable for the two-user MIMO networks with perfect delayed CSIT and 
imperfect current CSIT if there exists a (2 nRl , 2 nR2 , n) code scheme consists of: 

• two message sets Wi = [1 : 2 nRl ] and W 2 = [1 : 2 nR2 }, from which two independent messages W\ and W 2 
intended respectively to the Rx-1 and Rx-2 are uniformly chosen; 

• one encoding function for (each) transmitter: 

BC: x(t) = f t (w 1 ,W 2 ,H t -\H t ) 

\ . x ' (6a) 

IC: Xi (t) = fi, t [Wi, H*-\ H*J ,i = l,2; 

• and one decoding function at the corresponding receiver, 



W 



j 



9j 



(Yp,H n ,H n ), j = l,2 (7) 



for the Rx-j, where Yj l = { yj (t)}? =1 , 
such that the average decoding error probability Pi n \ defined as p]™- 1 = Ew 1 eW 1 ,w 2 eV} 2 ^' 

vanishes as the code length n tends to infinity. The capacity region C is defined as the set of all achievable rate 
pairs. Accordingly, the DoF region can be defined as follows: 

Definition 1 (the degrees of freedom region). The degrees of freedom (DoF) region for the two-user MIMO network 
is defined as 

V=\{d 1 ,d 2 )ER 2 + \V(w 1 ,w 2 )Em. 2 + ,w 1 d 1 +W2d2<\imsup{ sup ^^±^2)1 (g) 
[ " p^oo \( Rl ,R 2 )ee logP J J 

Accordingly, the main results of this paper are stated as the following three theorems: 
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Theorem 1 (outer bound). For the two-user (M, N) MIMO network with perfect delayed and imperfect current 
CSIT, the DoF pair {d\,d 2 ) € M? + is outer-bounded by 

d 1 < min{M,iV}, (9a) 

d 2 <mm{M,N}, (9b) 

di d 2 min{M, 2N} - min{Af , N} 

min{M, N} min{M, 2iV} " ' min{M, 2N} 1 Cj 

rfi , d 2 min{M ; 2iV}^min{M ; jV} 

min{M, 27V} min{M, N} ~ min{M, 2iV} ' k ; 

for BC: d x + d 2 < min{M, 27V}, (9e) 
for IC: d x + d 2 < min{2M, 2N, max{M, N}}, (9f) 
vv/zere a is f/ze current CSIT quality exponent. 

Proof: Note that the first two bounds hold because the DoF of the point-to-point (M, N) MIMO channel is 
upper bounded by min{M, N}, and the sum DoF outer bounds (9e)-(9f) are from BC/IC with perfect CSIT setting 
[1,2], which should hold also for our setting. Hence, it is sufficient to prove bounds (9c) and (9d), which are 
provided in Section IV. ■ 
Remark: Interestingly, when M > N, the sum DoF outer bounds of BC and IC coincide with d\ + d 2 < 
min{M, 2iV}, and hence both BC and IC hold the same DoF outer bounds. Nevertheless, these sum DoF bounds 
are inactive when M > N as will be detailed below. 

Theorem 2 (tightness for the M < N and M > 2N cases). For the two-user (M, N) MIMO network with perfect 
delayed and imperfect current CSIT, the above outer bound of DoF region is tight when M < N and M > 2N. 

Proof: The outer bounds can be deduced from (9) and the achievable schemes are in Section V. A and V. C. ■ 
Remark: 

. When M < N, the DoF regions with no CSIT, with delayed CSIT and with perfect CSIT are identical [5, 
7-9], i.e., the weighted sum DoF bounds (9c)-(9d) are inactive, showing additional CSIT does not improve 
DoF region. 

• When M > 2N, for (M, N) MIMO network with perfect delayed and imperfect current CSIT, the optimal 
DoF region can be characterized by 

di < N, (10a) 
d 2 < N, (10b) 
d 1 +2d 2 < N{2 + a), (10c) 
2di +d 2 < N{2 + a), (lOd) 
which is N times enlarged of the MISO case [11, 13]. Note that the sum DoF bounds (9e)-(9f) are inactive 
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and redundant. For illustration, the DoF region is provided in Fig. 1. Those with pure perfect delayed CSIT 
(black), and perfect instantaneous CSIT (dashed black) are also plotted for the comparison purpose. It shows 
that the DoF region in our setting is strictly larger than that with pure delayed CSIT and approaches the region 
with perfect CSIT as the quality of current CSIT increases. 




Theorem 3 (inner bound for the N < M < 2N case). For the two-user (M, N) MIMO network with perfect 
delayed and imperfect current CSIT, the DoF region 

di<N, (11a) 

d 2 <N, (lib) 
di d 2 M-N 

i + ii^ l + — a > (llc) 

di d 2 M-N 

ii + i^ l + ^ a > (lld) 

2MN ± MiM-N)a 

M+N ' V ; 

is achievable for the case when N < M < 2N. 



s 



Proof: The necessary condition of the inner bounds (lla)-(lld) can be deduced from (9), while the sum DoF 
inner bound (lie) inactivates the sum DoF outer bounds (9e)-(9f), and the sufficient condition (i.e., the achievability) 
is provided in Section V. B. ■ 
Take (3, 2) MIMO network for instance. The achievable DoF region with mixed delayed and imperfect current 
CSIT (red) is provided in Fig. 2. The DoF regions with pure perfect delayed CSIT (black), perfect CSIT (dashed 
black) and also the outer bound (dashed red) are also plotted for comparison. Compared to the outer bound, the 
achievable DoF region is further bounded by a sum DoF constraint (cf. (lie)). 




Fig. 2: The achievable DoF region for two-user (M,N) MIMO networks (e.g., M = 3, N = 2, and a = 0.5). 



Notice that the achievable DoF region is subsumed in the outer bounds, whose sum DoF bound has not yet been 
proven. We conjecture that such achievable scheme is optimal, since it smoothly bridges two extreme cases when 
a = and a = 1. 

IV. Proofs for the Outer Bound 
To obtain the outer bounds, the following ingredients are essential: 

• Genie-aided bounding techniques by providing side information of one receiver to the other one [5]; 

• Extremal inequality to bound the weighted difference of conditional differential entropies [16]; 

• Ergodic capacity upper and lower bounds for MIMO channels [17, 18]. 



9 



We first provide the outer bounds by employing the genie-aided techniques for BC and IC, respectively, reaching 
the similar formulation of the rate bounds. With the help of extremal inequalities, the weighted sum rates are further 
bounded. Finally, the bounds in terms of a are arrived by applying the ergodic capacity bounds of MIMO channels. 
Note that we consider only the case of M > N in this section since the DoF results for M < N coincide with the 
no CSIT case and are already known [7-9]. 



A. Genie-aided Bounding Techniques 

To obtain the outer bounds, we adopt a genie-aided upper bounding technique reminisced in [4,5], by providing 
Rx-2 the side information of the Rx-l's message W\ as well as received signal Y™. As stated in Assumption 3, both 
receivers know the CSI H(t) perfectly and instantaneously as well as the imperfect current CSI H(t) at time t. 

For notational brevity, we define a virtual received signal as 

— / ,\ a \ Hi(t)x(t) + Zi(t) forBC 

Vi(t) = < (12) 
[ H i2 (t)x 2 {t) + Zi (t) forIC 

and we also define X n = {x(t)}? =1 , X? = {xi(t)}? =1 , Y t k = {yi(t)}$ =1 , and Y+ = {y 4 (t)} t fe =1 . Denote also 
ne n = 1 + nRP^ where e„ tends to zero as n — >• oo by the assumption that linin^oo Pi™' = 0. 

1) Broadcast Channel: The genie-aided model is a degraded BC X n — > (Y",Y 2 n ) — > Y™, and therefore we 
bound the achievable rates by applying Fano's inequality as 

n(R 1 -e n )<I(W 1 ;Y 1 n \H n ,H n ) (13) 

n 

= J2 I (Wi;yi(t)\H n ,H n ,Y 1 t - 1 ) + ne n (14) 

t=l 

n 

= Y / (Kyi(t)\H n ,H n ,Y 1 t - 1 )-h(y 1 (t)\H n ,H n ,Y 1 t - 1 ,W 1 )) (15) 

£=1 

n 

< h( yi (t)\H(t)) - h( yi (t)\U(t),H(t)) (16) 
t=i 

n 

< nN log P - /»(»i(t)|U(t),iT(t)) + n ■ 0(1) (17) 

t=i 

n(R 2 - e n ) < I(W 2 - 17, Y 2 n , W t \H n , H n ) (18) 
= I(W 2 ; Y™, Y 2 n \W 1} H n , H n ) (19) 

n 

= Y / nW 2 :yi(t),y 2 (t)\H n ,H n ,Y 1 t -\Y 2 t -\W 1 ) (20) 

£=1 

n 

<Y,I(x{t);yi{t),y2(t)\H n ,H n ,Yl-\Yt\W l ) (21) 
t=l 

n 

J2 (MviW^WI^.-ff",*!*" 1 ,^" 1 .^) ( 22 ) 

- h( yi (t),y 2 (t)\x(t),H n ,H n , Y*~\Yt\ ^i)) (23) 



/,=i 
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<J2Kyi(t),y2(t)\H n ,H n ,Y 1 t -\Y 2 t -\W 1 ) (24) 

i=l 

n 

= J2Hm(t),y2(t)\u(t),H(t)) (25) 
*=i 

where U(i) = {l? -1 , ? 2 t_1 , if* -1 , Ji"', Wi j, (16) is from (12) and because (a) conditioning reduces differential 
entropy, and (b) {yi(t), y 2 (*)} are independent of H™ +1 and H™ +1 ; (17) follows the fact that the rate of the 
point-to-point M x N MIMO channel (i.e., between Tx an Rx-1) with M > N is bounded by nN log P + n ■ O(l); 
(19) is due to the independence between W\ and W 2 ; (21) follows date processing inequality; and (24) is obtained 
by noticing (a) translation does not change differential entropy, (b) Gaussian noise terms are independent from 
instant to instant, and are also independent of the channel matrices and the transmitted signals, and (c) the differential 
entropy of Gaussian noise is nonnegative. 

2) Interference Channel: Given the message and corresponding channel, part of the received signal is deterministic 
and therefore removable without mutual information loss. Hence, similarly to the BC case, we formulate a degraded 
channel model, i.e., X 2 — > (Y",Y 2 n ) -» . By applying Fano's inequality, the achievable rate of Rx-1 and Rx-2 
can be bounded as 

n{R x - e n ) < HWiiYftH", H n ) (26) 
= I(W 1 ,W 2 ;Y 1 n \H n ,H n ) -I(W 2 ;Y 1 n \W 1 ,H n ,H n ) (27) 

< nN\ogP-I(W 2 ;Y 1 n \W 1 ,H n ,H n ) + n-0(l) (28) 
= nN log P — h{Y"\W u H n , H n ) + h(Y 1 n \W 1 ,W 2 , H n , H n ) + n • 0(1) (29) 
= nN log P- h(Y?\W 1 ,H n , H n ) + n • 0(1) (30) 
= nN log P- h(Y 1 n \H n ,H n ) + n • O(l) (31) 

n 

< nNlogP - /*(vi(t)|H", H n , Y*~\ l?" 1 ) + n ■ 0(1) (32) 

i=l 

n 

= nN log P - h(yi(t)\U(t),H(t)) + n • 0(1) (33) 
t=i 

n(R 2 - e„) < I(W 2 ; Y™, Y 2 , W x \H n , H n ) (34) 
= I(W 2 ; Yf, Y 2 n \W u H n , H n ) (35) 
= I(W 2 ; Yj", Y 2 n \H n , H n ) (36) 

n 

= Y / I(W 2l yi(t) 7 y 2 (t)\H n ,H n ,Y 1 t -\Y 2 t - 1 ) (37) 
t=i 



< ^/(x 2 (t);y 1 (t),w 2 (t)|jr i ,H n ,y 1 t " 1 ,'*2" 1 ) (38) 
t=l 

n 

= ^(h(yi(t),y 2 (t)\H n ,H n ,Y 1 t - 1 ,Y 2 t - 1 ) (39) 



t=i 
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-h(y 1 (t),y 2 (t)\x 2 (t),H n ,H n ,Y 1 t - 1 ,Yt 1 )) (40) 

n 

<£h{y 1 (t),ih{t)\H n ,H n ,Y*-\Y*- 1 ) (41) 

t=l 

n 

= J2Hm(t),y2(t)Mt),H(t)) (42) 
t=i 

where U(t) 4 {^*- 1 ,y 2 *- 1 ,ff* \ff*}, (28) follows the fact that the mutual information at hand is upper 
bounded by the rate of the 2M x N point-to-point MIMO channel by cooperating two transmitters, given by 
nN log P + n ■ O(l); (30) is due to the fact that (a) transmitted signal Xf is a deterministic function of messages, 
H n , and H 11 ^ 1 as specified in (6), and (b) translation does change differential entropy; (31) and (36) are obtained 
because translation preserves differential entropy; (32) is because conditioning reduces differential entropy; (41) is 
because (a) translation does not change differential entropy, (b) Gaussian noise terms are independent from instant to 
instant, and are also independent of the channel matrices and the transmitted signals, and (c) the differential entropy 
of Gaussian noise is nonnegative; and (42) is obtained due to the independence V2(t)} of H™ +1 and H™ +1 . 

It is worth noting that BC and IC share the common structure of the achievable rate bounds, and therefore can be 
further bounded in a similar way. To avoid redundancy, we give the proof for IC, which can be straightforwardly 
extended to BC. 



B. Extremal Inequalities 
Define 



S{t) 



H 12 (t) 
H 22 (t) 



S(t) 



K(t)±E{x 2 (t)x" 2 (t)\U(t)}. 



(43) 



H 12 {t) 
H 22 (t)_ 

Due to the fact that Gaussian input maximizes the weighted difference of two differential entropies under the 
covariance constraint [16], given two integers p > 0, q > 0, we have 



h( yi (t),y 2 (t)\U(t),H(t))- h(yi(t)\U(t),H{t)) 

~ c%, pfSw), (lHm(t),V2(t)\U(t),H(t))-h( yi (t)\U(t),H(t)) 

tr(C)<Pp(x 2 (t)|U(t)) 
tr(C)<P K u (t)±C 



tr(c)<p «r„ (t )xc 



(44) 
(45) 

(46) 
(47) 



= max max E S(t) s(t) Q logdct(I + S(t)K u (t)S"(t)) - i logdct(I + H 12 (t)K u (t)H" 2 (tj) 

(^\ogdet{I + S{t)K u (t)S H (t))- Uogdet(I + H 12 (t)K u (t)H H 12 (t)^j (48) 



<Ej,,, max max Eo m io/,i 
- Sl - t > cyo, p( s ( t )) SWISW 

tr(C)<P^ (t y ; c 

where (46) is obtained because Gaussian distributed vector u(t) maximizes the weighted difference of two 
differential entropies over all conditional distribution of x 2 (t) with the same covariance matrix constraint, where 
K u (t) = E{u(t)u"{t)} = max p(u(t)) K(t) when p > q [16] and w(t) = [z[{t) z T 2 (t)] T ; (47) is obtained because 
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(a) u(t) is Gaussian distributed with covariance matrix being relative to (Jf t_1 , H*) and hence independent of S(t) 

given S(t) according to the assumption of Markov chain (4), (b) yi(t) is a degraded version of y 2 (i)); and 

(48) follows the fact that putting the expectation out of the maximization increases the value. 

Letting p = min{Af, 2iV} and q = min{M, N}, we have the following two cases: 
1 ) Case N < M < 2N : Given p = M and q = N, we have 

±h{jn{t),jh{W(t),H(t)) - lh(Wi(*)|U(t),ff(t)) (49) 
<E S(t) max max E S(t)|S(t) -J^ (N log det(I + S(t) K u (t)S" (t)) - M log det(I + H 12 {t)K u (t)H H 12 {t))) (50) 

CyQ, p(S(t)) 'MIS 
tr(C)<P K u (t)±C 

M - N 

< — ^ alogP + 0(l) (51) 
where the last inequality is obtained from the following lemma: 

Lemma 1. For two random matrices S = S + S E C 2NxM and H = H + H E C NxM (N < M < 2N), where 
S, H are respectively independent of S, H and with each entry Nc(0,cr 2 ) distributed, given any K > with 
eigenvalues Ai > ■ ■ ■ > Am > 0, we have 

NEg log dct(I + SKS") - ME A log dct(I + HKH ) < —N(M - N) \og{a 2 ) + 0(1) (52) 

as a 2 goes to 0. 

Proof: See Appendix B. ■ 
According to the Markov chain X% -> (Y^, YJ 1 ) — > Y™, we upper-bound the weighted sum rate as 

Rl + (53) 



N M 

^nlogP + ^f^MViW.teWIUW.HWJ-iMViWIUW.HW)) +n-0(l)+ne n (54) 



t=i 



/ M — N \ 

<n(l+ M a) logP + n-(9(l) + ne„ (55) 

and similarly obtain another outer bound by exchanging the roles of Rx-1 and Rx-2. Accordingly, the corresponding 

outer bounds of the DoF are obtained by the definition. 

2) Case M > 2N: Given p = 2N and q = N, we have 

^h(Vi(t),W2(*)|U(t),-ff(*))- ^h(jh(t)\U{t),H{t)) (56) 

<E dm max max E smSm (logdet(I + H 22 (t)K u (t)H^ 2 (t)) - logdet(I + ffi 2 (t)K„(t)ffr 2 (*))) (57) 

C^O, p(S(t)) 'IN 

tr(C)<P if„( t )XC 

< ±alogP + 0(l) (58) 

where (57) follows from the inequality det(I + A) < det(I + An)det(I + A22) where A is an 21 x 2i positive 
semidefinite matrix and An,A 2 2 are two I x Z submatrices on the diagonal without overlap; and the last inequality 
follows the lemma: 
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Lemma 2. For an N x M random matrix (M > N) H = H + H where H with each entry 3Mc(0, a 2 ) distributed 
is independent of H, given any K > with eigenvalues \\ > ■ ■ ■ > Am > 0, we have the upper and lower bounds: 



N 

Ejj log det(I + HKH") < ^ log(l + (\\H\\j + MN)\) + 0(1), (59) 

i=l 
N 

E A log det(I + HKH") > ^log(l + A iC 7 2 e c ) + 0{l). (60) 

i=l 

The difference of the upper and lower bounds can be further bounded by 

log(l + (|| JT||f + MN)\) - log(l + \ i( j 2 et) < - log(a 2 ) + 0(1) (61) 

where ( = J2iLi ip{N—i+l) and i[)(x) is the digamma function that given by [17] tp(x) = — J+J2p=i \ — l na; +l 
for integer x, where 7 is Euler's constant. 

Proof: See Appendix C. ■ 
In this case, the weighted sum rate can be bounded as 

n(2i?x + R 2 ) (62) 

n 

<n-2NlogP + Y, [h{yi(t),y 2 (t)\U{t),H{t)) - 2h(y 1 (t)\U(t), H(t))) + n ■ 0(1) + ne n (63) 
t=i 

< nN(2 + a) log P + n ■ 0(1) + ne n (64) 

and another outer bound can be similarly obtained by exchanging the roles of Rx-1 and Rx-2. Accordingly, the 
corresponding outer bound of the DoF region is obtained by the definition. 

V. ACHIEVABILITY 

With delayed CSIT, MAT alignment employed in [3-6] takes into account the completely outdated CSIT, regardless 
of the correlation between current and previous channel states. In fact, however, the current CSI is predictable 
from the past states if the underlying channel exhibits some temporal correlation. By employing power allocation, 
superposition coding, and successive decoding techniques, recent results [10, 11, 13] demonstrated that the DoF 
region can be enlarged, where the benefit of the estimated current CSIT is fully exploitable even it is imperfect. 

In the following, we extend such techniques to symmetric MIMO case, and construct a set of achievable schemes 
according to antenna configuration. In view of the similarity of achievable schemes of BC and IC, we only describe 
the corresponding schemes for IC avoiding redundancy. Three cases are considered: 

. Case A: M < N 

. Case B: N <M <2N 

. Case C: M > 2N 
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A. Case A: M < N 

In this case, the DoF region with no CSIT coincides with that with perfect CSIT [7-9], indicating the delayed 
or current CSIT is of nothing benefit. In particular, all the DoF regions are achieved by simple time division and 
sharing. 

B. Case B: N < M < 2N 

To make the achievable schemes easily followable, we detail a particular setting with M = 3 and N = 2, where 
two vertices 

Pb!: (2, a), P B2 : (°±^, ^) (65) 

are achievable as shown in Fig. 2. We relegate the generalized version to the Appendix D. 
Before proceeding further, we define 

Qy(t)eC 3x2 c ^.(t) e c 3xl c ^(t)}, (66a) 

W(t) eC 3xl CJt{H 2 u(t)}, W ± (t) e C 3x2 C N{H 2n (t)}, (66b) 

where H 2 u(t) is the first row of H 2 i(t). Note that the range and null spaces represent column space for 
notational conciseness. Note also that W(t) and W (t) lie in the range and null spaces of the first row of 
H 2 i(t), respectively. According to the definition of imperfect current CSIT, we have K\\Hji(t)Qji(t)\\p ~ P~ a 
andE||jy2ii(t)W- L (t)||?~i , - a . 

Achievability of Pbi ■ (2, a) 

This vertex can be achieved within one single slot (we drop time index here for conciseness) by transmitting 
codewords with superposition coding [19] as 

x 1 = Quu c + W^-Up, x 2 = Qi 2 v p (67) 

where u c e C 2xl with each symbol of rate (1 — a) \ogP is decodable by both receivers (referred to as common 
message) but only desirable by Rx-1, and u p e C 2xl , v p with each element of rate alogP can only be seen by 
its respective receiver (referred to as private messages), satisfying power constraints E||it c || 2 < P and E||w p || 2 = 
E|«p| 2 < P a . At the receiver side, we have (with noise terms being omitted hereafter for conciseness) 

V\ = HnQuUc + H 11 W A -u p + H 12 Q^ 2 v p , (68a) 

P pa pa 

y 2 = H 21 Q llUc + H 21 W ± u p + H 22 Q± 2 v p , (68b) 

v v ' ' 

P Pa 

2/2i = H 211 QnU c + H 211 W^u p + H 212 Q^ 2 v p , (68c) 
s v ' ^ ' ^ ' 

P p(> pa 

where y 2 i is the first element of y 2 . The common (cf. u c ) and private (cf. u p and v p ) messages are recoverable by 
successive decoding [19], which takes three steps: (i) retrieving 2x1 vector u c first with all receive antennas by 
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treating others as noise, from 2 linearly independent equations each [cf. (68a)-(68b)], yielding in total 2(1 — a) 
DoF desired only by Rx-1; (ii) reconstructing the whole terms HuQuu c in (68a) and H 2 uQuu c in (68c), and 
consequently subtracting them from received signals; (iii) recovering the private messages u p (resp. v p ) from the 
residual signal in (68a) at the Rx-1 [resp. in (68c) at the Rx-2], yielding 2a DoF for Rx-1 (resp. a DoF for Rx-2). 
We point out that, in the last two steps only the signal at the first antenna of Rx-2 is considered [cf. y 2 iL where 
the interference arisen from u p is pre-canceled by precoder W ± . It is worth noting that the numbers of private 
messages for both receivers are imbalanced due to the limited dimensionalities, i.e., M < 2N. 

In the end, Rx-1 achieves the total 2 DoF while Rx-2 obtains a DoF. By exchanging the roles of the two receivers, 
the DoF pair (a, 2) can be achieved in a similar way. 

Achivability of P B2 : S=s) 

Similarly to the case with pure delayed CSIT [5,6], (M + N) = 5 time slots are required to achieve such point, 
but in a much more involved way. In the following, we first provide a variant of MAT alignment, followed by a 
new scheme integrating the imperfect current CSIT. 

1) A Variant of MAT Alignment: With M — 3 and N — 2, the original MAT alignment for MIMO IC can be 
achieved in a different fashion with two phases: data symbols are sent all together in the first phase whereas the 
overheard interferences are forwarded in the second phase, as illustrated in Fig. 3. 

Phase-1 (N = 2 slots): The transmitted signals can be given by 

X!(t) = u{t), x 2 {t)=v{t), t = l,2 (69) 

where u(t),v(t) € C 3xl . The received signals of N = 2 antennas at the RX-1 and the first M — N = 1 antenna at 
the Rx-2, where in total M = 3 receive antennas are sufficient, are given by 

yi(t) = H n (t) Xl {t) + H 12 {t)x 2 {t), (70a) 

S v ' 

y 21 (t) = H 211 (t) Xl (t) +H 212 (t)x 2 (t), (70b) 

s v ' 

nil 

where ijj — [77^ rjj 2 } T . Note that after two slots the Rx-1 requires {r/j fc , t,k = 1, 2} to cancel the interferences as 
well as another M — N — 1 linearly independent equation which can be obtained by {rfti,t — 1, 2} to retrieve u(t). 
On the other hand, the Rx-2 requires {77^, i = 1, 2} to cancel the interferences at the first (M — N = 1) antenna as 
well as another N = 2 linearly independent equations provided by {r)l k ,t, k = 1, 2} to retrieve v(t). After Phase-1, 
the transmitter has access to Hij(t) (t — 1,2). Together with the transmitted symbols, it can reconstruct the total 
MN — 6 interference terms jj^, r)\ 2 , r] 21 , r] 22 , T]^, t] 21 . All of them will be delivered in the following M = 3 time 
slots, where in each slot N = 2 terms are sent. 

Phase-2 (M = 3 slots): Before forwarding the interference terms, we concatenate them into M — 3 vectors, e.g., 
Vi — [vli V21Y' V2 — [V12 V22Y an ^ Vi — [Vii V2iV- With such interference grouping, we construct the transmitted 
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signal as 

*i(3)=0 faji(4)=0 f aji(5)=«(5)^ 

as 2 (3) = *(3)^} '\ aj a (4) =«(4)^ '} xj(5) = 
where <&(<) e C 3x2 (t = 3, 4, 5) are random matrices, as shown in Fig. 3. With N = 2 antennas at each receiver, 
fjl (i = 1, 2) can be successfully recovered. As stated above, with the required interference terms as well as sufficient 
linearly independent equations, the transmitted symbols in Phase- 1 are all recoverable, yielding ^f_^ N = § DoF for 
each Tx-Rx pair. 



Slot-l 



Slot-2 



Slot-3 



Slot-4 



Slot-5 



5 E 



01 

5 E 



u lO 

< 







00 







00 



a a 



a 



a a 



□ 



a 



-> time 



_ a 

1 -a 



1 -a 



[ | Rx-l's symbol Q Rx-2's symbol 



interference 



interference 



Fig. 3: Illustration of the difference between MAT alignment and the achievable scheme for (3, 2) MIMO IC, where 
the height of the brick indicates the data rate in terms of log P. 



2) Integrating Imperfect Current CSIT: With the imperfect current CSIT, the new transmission scheme follows a 
similar step to the aforementioned MAT variant, but together with the philosophy developed in [10, 11]: 

• Precoding and proper power allocation: Instead of sending the crude symbols like in (69), we now exploit the 
spatial precoding to send two symbols to each user as 



xi(t) 
x 2 (t) 



u(t), 
v(t), 



4= 1,2 



(72) 



W(t) W ± {t) 

Qi2{t) Qi 2 (t)_ 

and assign P 1 ^", P. P to three symbols in u(t) whereas P 1_a , P to three symbols in v(t), as 

shown in the first two blocks of Fig. 3. It follows that the effective rates of u(t) and v(t) in each slot are 
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(3 — a) log P and (3 — 2a) log P, respectively. It therefore generates MN = 6 interferences in total, i.e., 

nj = H 12 (t) [ Ql2 ( t) Q ±(t)] v(t), t = i 2 

Iti = H 2 n(i) [w(t) W-'-(t)] «(*), 

where Qi 2 (i) e C 3x2 , Q^i) e C 3xl , W(t) e C 3xl , W-'-(i) e C 3x2 , ^ e C 2xl and ^ is a scalar. 
Recalling the property of the precoders in (66), we know that the interference terms r/l and r\^ x in Phase- 1 
have a power scaling as P 1- ". As did in the previous subsection, we pack these interferences into M = 3 
vectors: fj{ = [j^ 77^] T , tj| = [r]\ 2 r/l 2 ] T , and J7 2 = [77^ ?7 2 ! 1 ] T . 

Quantization and digital retransmission: Instead of forwarding ff k (i = 1,2) directly, we quantize them with 
source codebook size of (1 — a) \ogP bits each, which makes the quantization error negligible in terms of 
DoF, and then retransmit their indices, denoted by rf k (i = 1, 2), in a digital fashion. The feasibility of such 
digital transmission is demonstrated in details in [11]. 

Superposition coding and successive decoding: On top of codewords of the digitalized interferences f]\ (i = 
1, 2) with each element of rate (1 — a) logP (cf. common message), we encode another fresh symbols (cf. private 
message) with rate alogP each by using superposition coding techniques [19], i.e., 

x 1 (3) = W ± (3)u p (l) f xi (4) = W- L (4)« p (2) f a;i(5) = $(5)r) 2 + W^{Z)u p 

x 2 (3) = $(3)^1 + QU3)v P (l) { a 2 (4)=*(4)$+C#- 2 (4)t; p (2) { x 2 (5) = Q± 2 (5)v p (3) 
where u p (s) £ C 2xl , s = 1, 2, 3 and w p (s) is a scalar. Note that the power allocation satisfies E||i7^,|| 2 < P 
whereas E||w p (s)|| 2 < P a and E|i;p(s)| 2 < P a . Thus, the received signal in Slot-3 can be given by (those in 
other slots can be similarly obtained) 

yi (3) = H 12 (3)^(3)vl+H 11 (3)W ± (3)u p (l) + H 12 (3)Qi 2 (3)v p (l), (74a) 

^ V ' v ' v ' 

P pa p 

1/2(3) = i?22(3)$(3)r) 1 1 +Jf 21 (3)W ± (3) Mp (l) +if 22 (3)Q^ 2 (3)« p (l), (74b) 

^ V ' S v ' 

P Pa 

y 21 (3) = J ff 212 (3)$(3)j7 1 1 +i/ 211 (3)l^ ± (3)t tp (l) + J Ff 212 (3)Q^ 2 (3) Wp (l). (74c) 

S V ' " v ' S ^ ' 

P pO pa 

At the receivers, by successive decoding described in the previous subsection, both the common (cf. r)\) and 
private [cf. u p (l) for Rx-1 and v p (l) for Rx-2] messages can be successively decoded, yielding 2a (resp. a) 
DoF from private messages u p (l) for Rx-1 [resp. v p (l) for Rx-2]. Those for Slot-4 and Slot-5 can be similarly 
deduced. In the end, the codeword vector rj[, f\\ and i)\ can be all recovered (and in turn r)\ and 77^ with 
negligible quantization error) at both receivers, serving to cancel the overheard interference as well as to 
provide additional linearly independent equations for u(t) and v(t), t — 1,2. Both with MN — 6 linearly 
independent equations, the vectors u(t) and v(t) (t = 1, 2) with total MN = 6 symbols each are recoverable 
at its respective receiver. As illustrated in Fig. 3, the DoF pair ( 6+ 5 4a , ^ip) is therefore achieved. 
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C. Case C: M > 2N 

Due to the fact that additional transmit antennas do not increase DoF as long as M > 2N in both perfect 
instantaneous and delayed CSIT setting [2,5], we consider M = 2N case here for illustration. With for instance 
M = 4 and iV = 2, we demonstrate the outer bound in Theorem 1 is tight (as Theorem 2 claimed), where the 
vertices 

4 + 2a 4 + 2a \ 

3 ' 3 J ( ' 

and their counterparts (with exchanged roles of both receivers) are all achievable as shown in Fig. 1 . Note that this 

case coincides with the MISO case except for the multiplication factor of 2. We therefore relegate the generalization 

to the Appendix E. 

Following the similar definition, we have 

Qij(t) e C 4x2 c X{Hij(t)}, Q^(f)eC 4x2 cw{4(t)} (76) 
v,hettK\\H ji {t)Qf i (t)\\^P- a . 

Achievability of Pqi '■ (2, 2a) 

This vertex can be achieved within one single slot. The transmission with superposition coding can be given by 

xi = Qnu c + QiiUp, x 2 = Qi 2 v P ( 77 ) 

where u c e C 2xl is a common message and decodable by both receivers but only desirable by Rx-1, and 
u p , v p e C 2xl are private messages which can only be seen and decoded by their corresponding receivers. These 
transmitted symbols are assumed to satisfy the power constraints E||it c || 2 < P with each symbol of rate (1 — a) logP, 
and E||itp|| 2 = E||t) p || 2 < P a with each symbol of rate a log P. At the receiver side, we have 

yi = HnQnUc + HnQ^Up + H 12 Qi 2 v P , (78a) 

v ' ^ „ ' v ' 

P pa p(> 

y 2 = H 21 Q n u c + H 21 Q 21 u p + H 22 Q\ 2 v p . (78b) 
v ' ^ „ ' v ' 

P pO pa 

By successive decoding, both the common (cf. u c ) and private (cf. u p , v p ) messages can be subsequently decoded, 
yielding total 2 and 2a DoF for Rx-1 and Rx-2, respectively. 

Achievability of P C2 : ^) 

Differently from the Case B, three time slots are sufficient to achieve such a vertex, which can be straightforwardly 
extended from the MISO case [13]. 



Pci : (2,2a), P C2 : 
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Slot-1: The transmitted signals from both transmitters are given by 

= [g 2 i(l) Q^i(l)] "(1), (79a) 
*2(1) = [Qi 2 (1) Qi 2 (i)} ™ 
where = [u[(l) u T 2 (l)]\ v(l) = u5(l)] T with m,(1),^(1) e C 2xl . At both receivers, we have 

Vi(l) = ffn(l)[Q 2 i(l) gi(l)l«(l)+ffi2(l)[Qi 2 (l) Qi-(1)1^(1), (80a) 



ifa(i) = fr 2 i(i) 



Q2l(l) 021(1) 



u(l)+H 22 (l) 



Ql2(l) Ql2(l) 



«(!)> 



(80b) 



where rf e C 2xl , i = 1,2. By balancing the allocated power among those vectors, i.e., E||m 1 (1)|| = E||«i(l)|| <~ 
P 1_Q and E||tt2(l)|| 2 = E||t>2(l)|| 2 = P — 2P 1 ~ a <~ P, we approximate the total power of interference as 
E||?7i|| 2 ~ pi-a anc [ E||j7 2 || 2 ~ P 1_Q . Accordingly, the effective transmission rate of Slot-1 is (4— 2a)logP 
for each Tx-Rx pair. As did in the previous section, the interferences are first quantized and then retransmitted in 
the digital fashion. Denoted by f)i (i = 1, 2) the indices of the quantized version, the digitalized interference are 
forwarded in the ensuing two slots. 

Slot-2 & Slot-3: These two slots are similar. In Slot-2 (resp. Slot-3), apart from the forwarding of the digitalized 
interferences 172 (resp. 771), the Tx-1 (resp. Tx-2) sends a new symbol vector with superposition coding, while only 
a new symbol vector at the Tx-2 (resp. Tx-1). The transmitted signals in two slots are 

*i(2) =g 2 i(2)J7 2 + Q 2 L i(2H2) 
x 2 (2) = QU2)v(2) 

: e 1 (3) = Q 2 l 1 (3)m(3) 
^2(3)-Qi 2 (3)r) 1 +Q^ 2 (3)t;(3) 



Slot-2: 



Slot-3: 



(81a) 



(81b) 



where ^eC 



2x1 



1, 2 are treated as common messages with each element of rate (1 — a) logP and power 



constraint E||r)ij| 2 < P, and u(2), v(2), u(3), v(3) € C are private messages with each symbol of rate alogP, 
satisfying E||w(t)|| 2 =E||i;(t)|| 2 < P a , t = 2,3. 

In Slot-2, the received signals are give by (the received signals in Slot-3 can be similarly obtained) 



yi(2) = H 11 (2)Q 21 (2)r l2 + H 11 (2)Qi 1 (2)u(2)+H 12 {2)Qi 2 (2)v(2), 

s v ' " v ' s v ' 

p P a p° 

y 2 (2) = H 21 (2)Q 21 (2)fj 2 + H 21 (2)Q 2 L 1 (2)«(2) + H 22 (2)Q^ 2 (2)v(2) . 



(82a) 
(82b) 



p p0 pa 

By successive decoding described in the previous subsection, both the common message (cf. r) 2 ) and the private 
messages [cf. u(2) for Rx-1 and v(2) for RX-2] are retrievable provided 2 linearly independent equations at their 
respective receivers. The same strategy applies to Slot-3. 

In the end, 2x1 codeword vector 772 and 771 can be all recovered (and in turn T71 and 772 with negligible 
quantization error) at both receivers, serving to cancel the overheard interference as well as to provide additional 
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linearly independent equations for v(l) and u(l), respectively. With 4 linearly independent equations, the 4 x 1 
vectors u(l) and v(l) are both recoverable with high probability at their respective receivers. Hence, the total 
(4 + 2a) DoF for each receiver is achieved within three time slots, and in turn Pc2 is achievable by symmetry. 

VI. Conclusion 

In this work, we focus on the two-user MIMO broadcast and interference channels where the transmitter(s) 
has/have access to both delayed CSIT and an estimate of current CSIT. Specifically, the DoF region of one important 
MIMO scenario offered by the same numbers of transmit and receive antennas is characterized, relying on the 
forwarding to users of a suitably quantized version of prior interference obtained under imperfect linear ZF precoding 
at the transmitter(s). With multiple antennas at the receiver, the additional spatial dimension enables to cancel 
a certain amount of residual interference generated by the impreciseness of transmit precoding, and thus further 
enhances the achievable DoF. Our DoF region covers a family of CSIT settings, coinciding with previously reported 
results for extreme situations such as pure delayed CSIT and pure current CSIT. The extension to the more general 
MIMO cases with arbitrary number of antennas is still an open and challenging problem. 

Appendix 

A. Preliminaries 

Before proceeding further, we provide the following preliminary results. 

Let A e C JVxM , N < M, be a full rank matrix and A' € C NxM ' , M' < M, be a submatrix of A. We have the 
following lemmas that will be repeatedly used in the rest of the proof. 

Lemma 3 (rank of submatrix). 

rank(A') > rank(A) - (M - M'). (83) 
Lemma 4. Let 3\, ... ,3 m be a cyclic sliding window of size N on the set of indices {1, . . . , M}, i.e., 

J fe = {(fc + i) M + 1 :i€ [0.JV-1]}, fc=l,...,M. (84) 
If the columns of A are arranged such that rank(Aj fc ) = N for some k G [1, M], then 

M 

^rank(A 3 J>7V 2 (85) 
k=i 

where Aj k is the matrix composed of N columns of A defined by 3k, i.e., Aj k = \-A.j,i]je[i,N],ieJk- 

Proof: The sketch of the proofs for the above lemma is illustrated in Fig. 4(a). Given that there exists k such 
that the submatrix selected by the window is full rank N (the blue window in Fig.4(a)), the rank of the submatrix 
selected by the window 3 fe+1 or 3^-1 (the red window in Fig.4(b)) is lower bounded by N — 1. By applying the 
same argument, it is readily shown that the rank of the submatrix selected by the window 3k+2 or 3^-2 is lower 
bounded by N — 2. This lower bound keeps decreasing when the window slides away from the blue one, until it hits 
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another lower bound N — (M — N) = 2N — M given by Lemma 3. The submatrices within the sliding windows 
are of rank 2N - M, which lasts M - 1 - 2(M - N) = 2N - M - 1 times. With the help of Fig.4(a), a lower 
bound on the sum of the ranks of all the submatrices visited by the sliding window, can be obtained by counting 
the dots in the Figure, i.e., 

M-N 

N + 2 (N — i) + (2N — M)(2N - M + 1) = N 2 . (86) 

i=l 

In fact, this can be found easily by "completing the triangle", the number of dots in which is iV 2 . ■ 

Lemma 5. A' e C^"* , N < M' < M, is a submatrix of A. We define . . . , 7 M , as a cyclic sliding window 
of size N on the set of indices {1, . . . , M'}, i.e., 

3' k = {{k + i) M ' + l :*€ [0,iV-l]}, k = l,...,M'. (87) 
If the columns of A' are arranged such that the first rank( A') columns of AL are linear independent for some 

k 

k e [1, M], then we have 

M' 

^2 rank(A'j, ) > N(N - (M - M ')) (88) 
fe=i 

where A' 3 , is the submatrix of A' with N columns defined by 3' k , i.e., A'j, = [A^ ■. i\je[i.N].ieJ' k - 

Proof: The sketch of the proofs for the above lemma is illustrated in Fig. 4(b). Given that there exists k 
such that the submatrix selected by the window has rank r = N — (M — M') given by Lemma 3 and that the 
first r columns are linearly independent (the blue window in Fig.4(b)), the rank of the submatrix selected by the 
windows 3' k _ 1 , . . . ,3' k _( N _ r ) (the red and brown windows in Fig.4(b)) is lower bounded by r — 1. This lower 
bound keeps decreasing when the window slides go away from these positions, until it hits another lower bound 
— (M ~ N) = 2N — M given by Lemma 3. With the help of Fig.4(b), a lower bound on the sum of the ranks of 
all the submatrices visited by the sliding window, can be obtained by counting the dots in the Figure. In fact, after 
some basic computations, it turns out that there are N(N — (M — M')) dots. ■ 

Lemma 6. Let H e C NxM , N < M, with H = H + H. If H has i.i.d. K c (0, a 2 ) entries, then 

log det(HH") = (N- rank(ff )) log a 2 + 0(1) (89) 

as a 2 goes to 0. 

Proof: The proof starts by rewriting the non-central Wishart Matrix HH H with H = H + H. By SVD, the 
deterministic matrix can be decomposed as H = UkV H , where A € C NxM . Thus, we have 

HH" = U (AA T + U H HVA T + AV"H H U + U H HH H U^j U H (90) 

= U (AA T + <jGA t + a AG" + a 2 GG H )U H (91) 

where G = ±U H HV e C NxM with each entry distributed as N c (0, 1). 
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rank(A 3l ) 



N 



1 * j 



• • • 



2N-M 



--4- 

I 

I 



I 
I 

-V 



M 



(a) An example for Lemma 4 



rank(A 3i ) 




1 M' M 

(b) An example for Lemma 5 



Fig. 4: Illustrations of the worst-case ranks of the submatrices from a sliding window. For each k, the number of 
vertical dots represents the rank of the submatrix Aj k . In particular, the number of red (resp. blue) dots is the rank 
of the submatrix selected by the red (resp. blue) window. The sum of the ranks can be found by counting the 

number of dots. 



Let B = GG H be divided into block format as B = f 2 J with B 1 e C rxr and B 4 e c(N-r)x(N- r )^ where 
r = rank(H'). Similarly, let G = [ %\ %\ ] with G x € C rxr and G 4 € £{N-r)x(M-r) and Ai = diag{A 1; • • • , A r }. 
Then, we have 



\ogdet(HH") = logdet(AA T + aGA T + crAG" + a 2 GG H ) 

A 2 + ctGiAi + a 2 B x aA 1 G" 3 + a 2 B 2 



where 



log det 



cG 3 Ai + <r 2 B 3 



a 2 B 4 



(92) 
(93) 



log[det(a 2 J B 4 )d G t(A2 + aG^ + a 2 B 1 - (aA^ + a 2 B 2 )(a 2 B i )- 1 aG 3 A 1 + a 2 B 3 )] 



= log det (<r 2 B A ) + log det (Xi + aX 2 + a 2 X 3 ) 

= log det (cr 2 ^) + log det (Xi) + log det (I r + oX^ 1 X 2 + o 2 X^ 1 X 3 ) 
= log det (cr 2 ^) -flog det (Xi) +0(1) 

Xi =A 2 +A 1 G^ 4 - 1 G 3 A 1 

X 2 = G\Ai — AiG 3 B 4 1 B 3 — B 2 B 4 1 G 3 A 1 

X 3 = Bi — B 2 B A 1 B 3 



(94) 
(95) 
(96) 

(97) 
(98) 
(99) 



are all random matrices independent of a, and the last line is according to the approximation of Taylor expansion 
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for the determinant, i.e., 

det(I + eX) = 1 + etr(X) + 0(e 2 ) = 0(1) (100) 

when e goes to 0. 

Since B 4 and Xi are made of Wishart matrices, both Ejj logdet(B 4 ) and Efj log det(Xi) exist. Hence, we 
have 

Efj log det (i?ff H ) = Eh log det(cr 2 B 4 ) + E^ log det(Xi) + 0(1) (101) 
= (N - r) \og(a 2 ) + 0(1) (102) 

where B A e C ( - N -^ x ^ N ~ r \ This completes the proof. 

■ 

B. Proof of Lemma 1 

Let us define M' as the number of eigenvalues of K that are bounded from 0, i.e., > O(l), as P grows large, 
and let K = VAV H be the eigenvalue decomposition of K. We first establish the following upper bound: 

det(I + SKS") = det(I + 9A9 H ) (103) 
<dct(I + 6'A'(0') H ) + O(l) (104) 
<det(I+||e'|||A') + 0(l) (105) 

where G = SV; G' is a submatrix composed of the M' first columns of G and A' is a diagonal matrix composed 
of the M' largest eigenvalues of K. Therefore, we have 

Eg logdet(I + SKS") < log det(I + A') + O(l) (106) 
< logdet(A') + 0(1) (107) 

where the last inequality holds since each eigenvalue of A' is bounded from 0. 

Let $ = HV, <&' = HV. Without loss of generality, we assume that the columns of $ and <&' are arranged 
such that the conditions in Lemma 4 and Lemma 5 are satisfied (i.e., rank($j) = 7Y, where J is the cyclic window 
with size 7Y, and <&j is defined in the Appendix A), respectively. This also implies that the eigenvalues in A and A' 
are arranged accordingly. The lemma is solved for different values of M'. 

1 ) Case M' = M : In this case, we have 

det(I + HKH H ) = det(I + *A$ H ) (108) 
= det(A a )det(*i;$3) (109) 

JC{1,...,JV} 
M 

>2det(»5 fc *,Jdet(AjJ (110) 

fc=i 
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where (109) is according to eq-(16) in [18], and the lower bound is obtained by only considering a sliding window 
of size N for all the possible sub-determinant. Further, 



M 



logdet(I + HKH") > log ^det(*3 fc *jJdet(AjjJ (111) 

^ lo s ^E dct (*^*^) dct ( A ^)) ( 112 > 

^ M lo S ^ndet(*3 fc * 3 Jdet(A a J^ (113) 

= ± ^Vlogdet(A) + 2logdet(*3 lb *jJ^ (114) 

where (111) is from (110); (113) is from the AM-GM inequality; the last equality is from the sliding window 
property Jlfcli det(Aj fc ) = det(A) Ar . Finally, we have 

E^logdet(I + iIKIf H ) > [ iVlogdet(A) + ^E^logdet($3 fc * a J j (115) 



k=i 



> 
~ M 



X - ^logdet(A) + lo g( 7 2 ^(iV-rank(* a J)+0(l)^ (116) 

= ^ (n log det(A) + log a 2 (^MN - ^ rank (*a ^ (1 17) 

> ^ (log dct(A) + (M - TV) log a 2 ) (118) 

where $ = ffV and hence rank(<&) = rank(.ff); (116) is from Lemma 6; the last inequality is from Lemma 4. 
Combining (107) and (118), we have 

iVEglogdct(I + SK'S' H )-ME f jlo g dct(I + i?KH H ) < -N(M - N) log cr 2 + 0(1). (119) 

2) Case M > M' > N : For this case, we can first get 

det(I + HKH" ) = dot (I + $A$ H ) (120) 

>det(I + *'A' (*')")■ (121) 
Following the same foot steps as in the previous subsection, we can obtain 



Ejj logdet(I + HKH") > ^— ( iVlogdet(A') + loga 2 ( M'N - ^rank^) ] ] (122) 



M' 



M 
N 



fe=i 



> — (log det(A') + (M — N) log a 2 ) (123) 
where the last inequality is from Lemma 5. Combining (107) and (123), we have 

NEg log dct(I + SKS H ) -ME A logdet(I + HKH H ) (124) 

< jVEglogdet(H-SKS H )] - M' E 6 log det(I + HKH H )} (125) 
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< -N(M - N) log a 2 + 0(1). (126) 
3) Case M' < N : From (121) and given that M' < N, we have 

Ejj logdet(I + HKH") > logdct(A') + logcr 2 (m' - rank(* )) (127) 

> logdct(A') + logcr 2 (M' - (N - (M - M'))) (128) 

= logdct(A') + (M - N) logo- 2 (129) 
where (128) is from logcr 2 < and rank($) > N - (M - M 1 ). Combining (107) and (129), we have 

N Eg log det (I + SKS" ) — ME^ log dot (I + HKH H ) (130) 

< TV Eg log dot (I + SKS H ) - TVEjj log det (I + HKH") (131) 

< -N(M — N) logcr 2 + 0(1). (132) 
The above derivation considers all the possible relationships between M' and N, and therefore completes the proof. 

C. Proof of Lemma 2 

For the upper bound, we have 

E„logdct(I + HKH") =Efj\ogdet(I + U H i: H V^KV H i: H U^) (133) 

= E^logdct(I + £ 2 J V£KV H ) (134) 

<E ff logdet(I + A max (S 2 f )VSKV H ) (135) 

N 

= ^E^log(l + A max (iriT H )A i (V£ii:y ff )) (136) 



AT 



<5>ttlog(l + A max (iJiT)A 4 ) (137) 

i=l 
AT 

^^E^^l + UffH 2 ^) (138) 
^^E^loga + dl^lli + llffH 2 )^) (139) 

<^log(l + (|| J ff|| 2 + M7Vcr 2 )A l ) (140) 

i=i 

AT 

< ]T log(l + || 2 + MN)X t ) + 0(1) (141) 



i=i 



where in (133), if = U h ^hV^ with S w e C NxN and G C MxAr ; (134) comes from the equality 
det(I + AB) = det(I + BA); (137) is due to Poincare Separation Theorem [20] that \{V^KV H ) < \{K) for 
i = 1, • • • , TV; (138) is from the fact that ||i?|| 2 < ||-H1|f; (140) is obtained by applying Jensen's inequality to a 
concave function. 
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For the lower bound, we have 

E 6 logdet(I + HKH") > E^iVlog (l + dct(HKH") 1/N ^ (142) 

= E^iVlog M + exp ( i In det(HKH H )j J (143) 



> iVlog ( 1 +cxp ( ^E^lndet(H7<:iJ H ) ) ) (144) 



where (142) comes from Minkowski's inequality and (144) is from Jensen's inequality by noticing that log(l + e x ) 
is a convex function in x. Hence, the expectation part can be simplified further as 

E / jlndct(J?K J ff H ) =E^lndet(*A* H ) (145) 
> E f jlndet(*'A'*' H ) (146) 

N 

= ln]jA 4 +E f jlndct(*'*' H ) (147) 



i=i 

N 



> ln^A.+E^lndet (hV'V"H"^ (148) 

i=l 

JV 

= ln]jA 4 +E^lndet(if^ff' H ) (149) 

i=l 

N N 

= lnI](A j( 7 2 ) + ^</;(A^ + l) (150) 

i=l i=l 

where <& = HV <E <C NxM with V being the unitary matrix containing the eigenvectors of K, i.e., K = V AV H 
with A = diag(Ai, • • • , Am); in (146), = HV E C NxN with V being the first N columns of V where 
A' = diag(Ai, • • • , Ajv) and #A# H >; 3>'A'«I>' H ; (148) is due to the fact that the capacity of a Ricean fading 
channel is no less than that of a Rayleigh fading channel in all SNR region [21], by treating H as the deterministic 
line-of-sight component; (149) is because of the i.i.d. assumption of H, which is independent of V , and H' is 
an N x N matrix with each entry having the same distribution as that of H; the last equation is according to the 
distribution assumption of H and obtained from [17]. 
By the definition ( — jf 4>{N — i + 1), we have 

iVlog^l + exp(^E^lndet(JfKi? H )^ > ATlog ^1 + exp ^lnJ|(Ai<r 2 e c )^ (151) 

1 N 

>N-J2H(K<7 2 e<)] + (153) 

N 

>^log(l + A;<7 2 e c )-iV (154) 
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N 



^log(l + A;<7 2 e c ) + 0(l) (155) 



where (x) + = max{£,0} and last inequality due to the fact (log(a;)) + > log(l + x) — 1. 
The difference between upper and lower bounds can be further bounded as 

2 



log(l + {\\H\\i + MN)\.) - log(l + A,„V) < log ' + ^^j^ff^ ) (156) 



< -log(a 2 )+0(l) (158) 



where the second inequality is due to the fact log (i+f ) < log (l + §) and the last inequality is because both 
(H-ffllp + MN) and are bounded. This completes the proof. 

D. Generalized Achievable Schemes for Case B (N < M < 2N) 

In this case, we demonstrate the DoF region stated in Theorem 3 is achievable, by showing the vertices 

\ r, f MN + N 2 a MN + (M 2 — N 2 — MN)a\ 
P m :(N,(M-N)a) P B2 : 4^ >-) (159) 

and their counterparts (with swapped roles of two receivers) are all achievable. 
Before proceeding further, we define 

QiiWeC^C^fi)}, Q+(i)eC Mx ( M -^CN{i^(i)}, (160a) 

W(t) e C Mx ( M - w ) C X{H 2[1:M - m (t)}, W ± (t) g C MxAr C nH 2[ i:M-N]i(t)}. (160b) 

Note that the range and null spaces represent column space for notational conciseness. According to the definition 
of imperfect current CSIT, we have EWH^Qj-^W 2 ~ P~ a and E\\H j[1:M _ N]i (t)W ± (t)\\ 2 , ~ p- a . 

Achievability of Pbi 



This vertex can be achieved within one single slot by transmitting 

xi = Qnu c + W ± u p , x 2 = Q{ 2 v p (161) 

where u c e C Wxl with rate N(l — a) logPisthe common message decodable by both receivers but only desirable by 
Rx-1, and u p e C Nxl , v p e £,(M-N)xi w j m reS p ect i ve rate f N a \ogP and (M — N)a\ogP are private messages 
that can be seen by its respective receiver, satisfying power constraints E||it c || 2 < P and E||tt p || 2 = E||r> p || 2 < P a . 
At the receiver side, we have 

yi = HiiQnti c + H 11 W ± u p + H 12 Q± 2 v p , (162a) 

s ' v- ' * v ' 

P pa p() 

y 2 = H 21 Q llUc + H 21 W^u p + H 22 Qi 2 v p , (162b) 
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V2[1:M-N] — H 2 [l:M-N]lQllU c + H 2 [1-.M-N]1 W ± U p + H 2 [i;M-N]2Ql2 v P> 



(162c) 



P pa pc 

where y 2 [i:M-N] is the first M ~ N elements of y 2 . By first treating lower power terms as noise, both receiver can 
decode N x 1 vector u c from (162a) and (162b) provided N linearly independent equations, yielding N(l - a) 
DoF desired only by Rx-1. After that, the whole terms HuQuu c in (162a) and H 2 [ 1:M _ N ]iQuu c in (162c) 
can be reconstructed, and consequently subtracted from both received signals. Finally, the private messages u p 
(resp. v p ) can be recovered from the residual signal in (162a) at the Rx-1 [resp. in (162c) at the Rx-2], yielding 
Not [resp. (M — N)a] DoF for Rx-1 (resp. Rx-2). We refer to this procedure as "successive decoding". In the end, 
Rx-1 achieves total N DoF while Rx-2 obtains (M — N)a DoF. By exchanging the roles of two receivers, the DoF 
pair ((M — N)a, N) can be achieved in a similar way. 



Achivability of Pp 2 

It takes N slots in the first phase and M slots in the ensuing phase. 

Phase-1: In this phase, N time slots are consumed to deliver MN symbols for each receiver. The transmitted 
signals in t-th time slot (t = 1, ■ ■ ■ ,N) are given by 

a?i(t) = \w(t) W- L (t)} u(t), 



352 (*) 



Qi 2 (t) QUt) 



V(t), 



where u(t),v(t) G C Mxl . The received signals at both receivers can be given as 
l/i (t) = Hn(t) \w(t) W ± (t)] u(t) + H 12 {t) \ Ql2 (t) Q±(t)] v(t), 



V2[l:M-N]{t) = H 2[1:M _ N]1 (t) \\V(t) W^it)] u(t)+H 2[1:M _ N]2 (t) \Q 12 (t) Qi 2 (t)] v(t), 



(163a) 
(163b) 

(164a) 
(164b) 



>1— a 



r.2 (z p(M-N)xl 
7 't[l:M-iV] t "- 

where only the first M — N elements of y 2 (t) are taken into account. By allocating «[i : m-jv] ~ -P 1 
M[m-jv+1:M] = P — (M — iV)^ 1 -" - P, t>[i :J v] - P 1 "", and v [N+1 .. M ] = P - NP 1 ^ ~ P, we have 
^{ll^tll 2 } ~ -P 1- " and E{|| ^[i-m-jv] II 2 } ~ P la < making it possible to compress them into (1 — a) \ogP bits for 
each element. Note that the efficient transmission rates are (M — (M — N)a) log P for Tx-1 whereas (M — TVc^logP 
for Rx-2, respectively. 

Before proceeding further, we concatenate all the interference terms as 

fil, k = !,■■■ ,N | 0. /,-= I. ■■■ ..V 



k 
,k 



N+l,--- ,M 



r?L w , fc = JV + l, 



,M 



(165) 



,k = 1, - • • ,M — N. Note that we abuse 



where tj£ = 
the notation of f) for simplicity. 

Instead of forwarding the interference terms in an analog fashion [3,5,6], we quantize them first with source 
codebooks and then forward the indices of codewords to both receivers. A set of source codebooks {X\ k ,t = 
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1, • • • , N, k = 1, • • • , M — N} and {X 2 fe , f = 1, ■ ■ ■ , iV, fc = 1, • • • , N} with (1 - a) log P bits each are generated 
to represent the quantized elements of interference vectors at the Tx-1 and Tx-2, respectively. The codewords 
representing such quantized elements, denoted by i) 2 k and r\\ k , are chosen uniformly from {X\ k } and {X 2 fc }. 

Phase-2: The objective of Phase-2 is to forward the indices of quantized version r)\ and Vt[vM-N]- Using 
Gaussian channel codebooks, the indices of fj k are encoded one-to-one to c l k , with rate of N(l — a)logP bits 
satisfying power constraint E||cj.|| 2 < P. Then, {c k } are forwarded as common messages to both receivers in the 
following M slots. It is worth noting that if rf k = 0, the corresponding transmitter keeps silent for it, i.e., c\ = 0. 

The transmitted signals at both transmitters are given by 

aji(t) = <f>(t)c 2 s + W^{t)u p {s), (166a) 
x 2 (t) = *(i)cj + Q± 2 (t)v p (a), (166b) 

where t = N + 1, • • • , M, s = t — N, $>(t) G £ MxN [ s a random matrix without lying in the null space of cross 
channels, u p (s) e C Nxl and v p (s) G c( M - N ) xl are private messages for Rx-1 and Rx-2 with rates iVlogP and 
(M - N)\ogP, respectively, satisfying power constraint E||m p (s)|| 2 = E||t! p (s)|| 2 < P a . 
The received signals at the receivers are given by 

y 1 (0 = -ffii(t)*(i)c 2 + i?i 2 (i)*(i)cJ+ J ffii(t)W ± (t)u p (s) + i?i 2 (i)Qi- 2 (t)v p (s), (167a) 

S , ' ^ v ' ^ v ' 

P pc po 

y 2 (t) = H 21 (t)®(t)c 2 s + H 22 (t)^(t)c 1 s +H 21 (t)W ± (t)u p (s) + H 22 (t)Qi 2 (t)v p (s), (167b) 

S « ' ^ v ' 

p pc* 

y2[l:M-N]{t) = H 2[1 . M _ N]1 (t)$(t)c 2 + H 2[1 . M _ N]2 {t)${t)c\ 

s v ' 

p 

:M-N]1 (t)W- L (t)u p («) + H 2[1:M _ N]2 (t)Qi 2 {t)v p {s) . (167c) 

v v ' " v ' 

po pc 

Note that either c\ or c 2 is in each equation according to (165). By treating other terms as noise, c\ can be 

decoded from (167a)-(167b) at both receivers provided the compression rate is not greater than N(l — a) log P. 

After that, the terms containing c\ is reconstructed and subtracted from the received signals in (167b) and (167c) 

to recover the private messages (i.e., u p (s) and v p (s)). With N (resp. M — N) linearly independent equations, 

the private messages u p (s) (resp. v p (s)) with total N (resp. M — N) symbols are retrievable at Rx-1 (resp. Rx-2) 

as long as the power constraints are satisfied, yielding additional Na and (M — N)a DoF for Rx-1 and Rx-2 

respectively in each slot. In the end, the codeword vector c\ can be all recovered (and in turn i) k ) at both receivers, 

serving to cancel the overheard interference as well as to provide additional linearly independent equations for u{t) 

and v(t). With sufficient linearly independent equations, the vectors u(t) and v(t) are both recoverable with high 

probability at its respective receiver. Hence, we have the achievable DoF pair 

N(M -(M - N)a)+MNa MN + N 2 a 

"l = = — T~ r — 5 (168a) 

M+N M+N 

N(M - Na) + M(M - N)a MN + (M 2 - N 2 - MN)a 

d2 = mTn = mTn • (168b) 
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By swapping the roles of Rx-1 and Rx-2, the counterpart point is also achievable. 

E. Generalized Achievable Schemes for Case C (M > 27V) 

As stated before, we consider M = 2N case since additional transmit antennas do not increase DoF [2,5]. In 
this case, we demonstrate the outer bound in Theorem 1 is tight (as Theorem 2 claimed), where the vertices 

Pcl : {N , Na) , P C2: (ffi±^,^) (169) 

and their counterparts (with exchanged roles of both receivers) are all achievable. Note that this case coincides with 
the MISO case except the multiplied DoF by N. 
Following the similar definition, we have 

Q^(t)eC 2NxN cX{H l3 (t)}, Q^.(t)eC 2ArxAr cK{^.(t)} (170) 
where K\\H ^Qjii^Wl ~ P ~ a - 
Achievability of Pci 

This vertex can be achieved within one single slot. The transmission can be given by 

xi = Qnu c + Q 21 u p , x 2 = Qi 2 v P ( 171 ) 

where u c e C Nxl , u p e C Nxl and v p e C Nxl . The transmitted symbols are assumed to satisfy the power 
constraints E||it c || 2 < P and rate N(l - a) logP, whereas E||m p || 2 = E||t>p|| 2 < P a and rate NalogP. Although 
the symbol u c is decodable by both receivers and hence referred to as a common message, it is only desirable by 
Rx-1. On the other hand, we refer to u p , v p as the private messages which can only be seen and decoded by their 
corresponding receivers. At the receiver side, we have 

j/i = HuQuUc + HnQ^Up + H 12 Qi 2 v p , (172a) 

V v ' ^- ^ ' ^ ' 

P pa p() 

y 2 = H 21 QuU c + H 2 iQ 21 u p + H 22 Q{ 2 v p . (172b) 

V v ' ^- ^ ' ^ ' 

P pO pa 

By successive decoding, the two receivers decode common message u c by treating other terms as noise. Then, 
subtracting the terms with u c , two receivers can decode their own private messages successfully, yielding total TV 
and Not DoF for Rx-1 and Rx-2, respectively. 



Achievability of Pci 

To achieve such a vertex, three time slots are sufficient. 
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Slot-1: The transmitted signals from both transmitters are given by 

= [<92i(l) Q^i(l)] 
^(1) = [Q 12 (l) Q^(l)] t;(l), 

where u(l) = [^(1) «5(1)] T , w(l) = [t>I(l) ^(1)]' with Ui (l), € C JVxl . 
At both receivers, we have 

Vl (i) = jyn(i) \q 21 (i) gi(i)lu(i) + fr 12 (i)[Q 12 (i) q£,(i)1 «(i), 



(173a) 
(173b) 



77iec Nxl 

ifa(i) = fr 2 i(i) [q 2 i(i) gi(i)l«(i)+H22(i)[oi 2 (i) Qi,(i)l«(i)- 



(174a) 



(174b) 



r 72 eC iv >' 



By balancing the allocated power among those vectors, i.e., E||iti(l)| = E||ui(l)| ~ P and E||it 2 (l)| = 
E||?j 2 (1)|| 2 = P — NP 1 ^" <~ P, we approximate the total power of interference vectors as E||t7i|| 2 ~ P 1- " 
and E||r7 2 || 2 ~ P 1_Q . A set of source codebooks {Xu, X 2i ,i = 1, ■ ■ ■ , N} with size (1 — a) \ogP bits each are 
generated to represent the quantized elements of the interference vectors q 2 an d r}\ at the Tx-1 and Tx-2, respectively. 
The codewords representing the elements of ij 2 and 771 are chosen uniformly from {Xu} and {X 2i } and concatenated 
as i) 2 and fji, respectively. Then, the indices of i)i are encoded to Ci with rate 7Y(1 — a) \ogP and power constraint 
E||ci|r<P using a Gaussian channel codebook, and then forwarded as common messages to both receivers in the 
following two slots. 

Slot-2 & Slot-3: These two slots are similar. The objective of the Slot-2 (resp. Slot-3) is to convey the coded 
common message c 2 (resp. c\) carrying the information of i) 2 (resp. 771), together with a new symbol vector at the 
Tx-1 (resp. Tx-2), while only a new symbol vector is sent at the Tx-2 (resp. Tx-1). The transmitted signals in two 
slots are 

Xi(2) = Q 21 (2)c 2 + Q 2 l 1 (2)m(2) 
x 2 (2) = Qi 2 (2)v(2) 

*i(3) = Q 2 L 1 (3)w(3) 
x 2 (3) = Q 12 (3) Cl + Qi 2 (3)v(3) 

where «(2), v{2), u(3), v(3) e C Nxl with rate NalogP each, satisfying E||m(2)|| 2 = E||w(3)|| 2 < P a and 
E||.;(2)|| 2 =E||t,(3)|| 2 <P«. 

In Slot-2, the received signals are give by (the received signals in Slot-3 can be similarly obtained) 



Slot-2: 



Slot-3: 



(175a) 



(175b) 



yi(2) = H 11 (2)Q 21 (2)c 2 +H 11 (2)Q 2 L 1 (2)u(2) + H 12 (2)Qi 2 (2)v(2), 

s v ' s v ' s v ' 

p pa pO 

y 2 (2) = if 21 (2)Q 21 (2)c 2 + ff 21 (2)Q^(2)u(2) + H 22 {2)Q^ 2 {2)v{2) . 
„ ' „ ' „ ' 

p pO pa 



(176a) 
(176b) 
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By treating other terms as noise, N x 1 vector c 2 is retrievable at both receivers with high probability provided TV 
linearly independent equations. After that, u{2) and v{2) are also recoverable from N linear equations at the Rx-1 
and Rx-2 by subtracting the terms with c 2 from the received signals. The same strategy applies to Slot-3. 

In the end, N x 1 codeword vector Ci and c 2 can be all recovered (and in turn i)i and 172) at both receivers, 
serving to cancel the overheard interference as well as to provide additional linearly independent equations for 
v(l) and u(l), respectively. With 2N linearly independent equations, the 2N x 1 vectors u(l) and v(l) are both 
recoverable with high probability at its respective receiver. Hence, the total N(2 + a) DoF for each receiver is 
achieved within three time slots, and in turn P C2 is achievable by symmetry. 
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